Teleconferencing method and system for providing face-to-face, non-animated teleconference environment

ABSTRACT

A method and system provides a face-to-face video conference utilizing a video mirror. The method and apparatus comprise a first station having a first predetermined sensory setting; a second station having a second predetermined sensory setting; and an imaging system for capturing an image or sub-image at the first station, displaying at least a portion of the image or sub-image at the second station such that it becomes generally visually integrated with the second predetermined sensory setting. Also, disclosed is apparatus and method for effecting a face-to-face presence environment regardless of whether the first and second predetermined sensory settings are the same or different. The stations may be portable and/or modular such that they can be easily constructed or assembled. The stations may also be architectured and/or decorated to further enhance the face-to-face environment created by the video conferencing system and method.

RELATED APPLICATION

This application is a Continuation of application Ser. No. 08/308,603filed Sep. 19, 1994, issued as U.S. Pat. No. 5,572,248.

BACKGROUND OF THE INVENTION

The present invention is related to a video conferencing system andmethod and, more particularly, to a teleconferencing system which iscapable of producing a "video mirror" at a station such that anyparticipants at one or more remote stations may be imaged and displayedin the video mirror at the station so that they appear to be present orface-to-face with any participants at the station.

Visual telephone systems presently provide communication between atleast two locations for allowing a video conference among participantssituated at each station. An objective in some video conferencingarrangements is to provide a plurality of television cameras at onelocation. The outputs of those cameras are transmitted along with audiosignals to a corresponding plurality of television monitors at a secondlocation such that the participants at the first location are perceivedto be present or face-to-face with participants at the second location.In achieving good face-to-face presence, the number of confereesincluded in the video picture from each camera is normally limited to afew people, typically one to four. There are usually a like number ofmonitors at the receiving station, each strategically focused, alignedand positioned so that their displays appear contiguous, seamless andproperly aligned. The apparatuses and 2 methods employed heretofore toachieve proper positioning, focus and alignment have been complex andcostly.

Further, the images captured by the plurality of cameras must bearranged and displayed so that they generate a non-overlapping and/orcontiguous field of view, for example, as described in U. S. Pat. No.4,890,314 which issued to Judd et al. on Dec. 26, 1989 and which ishereby incorporated by reference and made a part hereof.

The prior art systems have also been deficient because they have failedto provide means for generating an image, such as an image of aplurality of participants, at one station, differentiating the image toprovide a differentiated image and subsequently compositing thedifferentiated image with a predetermined composite image to provide acomposited image which complements or becomes visually complementary,contiguous or integrated with the remote station when the image isdisplayed at the remote station.

Another problem with prior art video conferencing systems is eye contactamong participants at the stations. Typically, a camera is placedsomewhere above the display monitor at which a participant is observinga display of the participant from the remote station. Consequently, thecamera captures the participant at an angle above the participantsviewing level or head. Thus, when an image of that participant isdisplayed at the remote station, it appears as if the participant islooking down (e.g., towards the ground). Previous solutions to thisproblem have required complex optical systems and methods using, forexample, a plurality of lenses and mirrors. The solutions have usuallybeen designed for use when the camera is capturing an image of a singleparticipant, and they fall short when simultaneously capturing images ofmultiple participants.

The prior art stations themselves were not architecturally designed in amodular form so that they could be easily assembled, decorated andcombined with a video image or sub-image from the remote station in amanner which would enhance the virtual presence environment.

SUMMARY OF THE INVENTION

It is, therefore, a primary object of the present invention to provide aface-to-face teleconferencing system which enables a plurality ofparticipants at a plurality of stations to teleconference such that theparticipants generally appear face-to-face with one or more participantsat remote stations in the teleconferencing system.

Another object of this invention is to provide a differentiator ordifferentiating means which facilitates differentiating at least oneimage captured at a station into a differentiated image which willultimately be transmitted to at least one remote station.

Another object of this invention is to provide a method and system forcompositing an image or sub-image received from a remote station with apredetermined composite image to provide a composited image, at least aportion of which is displayed at the station.

Still another object of the invention is to provide a system or methodwhich provides a display having wide aspect ratio while utilizingcameras which generate images having smaller aspect ratios.

Still another object of the invention is to provide a method and systemfor defining a predetermined sensory setting at one or more stations inorder to enhance the virtual presence environment at that station.

Still another object of the present invention is to provide a method andapparatus for imaging subjects at one station, processing such images,and displaying such images at a remote station such that such imagescomplement and/or become visually integrated with the remote station.

Another object of this invention is to provide a method and apparatuswhich is capable of generating a composite image having a plurality ofdifferent resolutions.

Still another object of the present invention is to provide a "videomirror" at a station.

Yet another object of the invention is to provide an imaging systemwhich provides a simplified means capturing substantially eye levelimages of participants at stations while also providing means forsimultaneously displaying images at such stations.

Still another object of this invention is to provide a system and methodfor compositing a plurality of signals corresponding to a plurality ofimages from at least one station to provide a contiguous or seamlesscomposite image.

Still another object is to provide a method and system for providing aplurality of teleconferencing stations that have complementarypredetermined sensory settings which facilitate creating a face-to-faceenvironment when images of such settings and participants are displayedat remote stations.

Another object of the invention is to provide a method and apparatus forgenerating a video mirror such that an image having a predeterminedsensory setting of participants or subjects captured at one station maybe displayed at a remote station having a different predeterminedsensory setting, yet the remote participants will appear face-to-face inthe same predetermined setting as the participants or subjects at theone station.

In one aspect, this invention comprises an image generator for use in ateleconferencing system comprising a differentiator for comparing adifferential reference image to an input video image from a station andfor generating a differential image in response thereto, and acompositor associated with a remote station for receiving thedifferential image and for combining that differential image with apredetermined composite image to provide a composite image.

In another aspect, this invention comprises a conferencing systemcomprising a first station comprising a first sensory area defining afirst aura, a second station comprising a second sensory area defining asecond aura, and an image system for generating a first station image ofat least a portion of the first sensory area and also for displaying acomposite image corresponding to the first station image at the secondstation such that the first and second auras become visually combined toprovide an integrated face-to-face environment at the second station.

In another aspect, this invention comprises an image system for use in aconference environment comprising a station having a first conferencearea and a remote station having a remote video area, the image systemcomprising a compositor for compositing a first signal which generallycorresponds to a video image of a portion of the first conference areawith a composite reference signal to provide a composite image signal;and a display for displaying the composited image signal at the remotevideo area such that the first and second stations appearcomplementarily integrated.

In still another aspect, of the invention, this invention comprises ateleconferencing system comprising a sensory setting, a second stationhaving a second predetermined sensory setting; and an imaging system forcapturing an image at the first station and displaying at least aportion of the image at the second station such that it becomesgenerally visually integrated with the second predetermined sensorysetting.

In another aspect of this invention, this invention comprises a stationfor use in a teleconferencing environment comprising a first stationpredetermined setting, first image sensing means associated with thefirst station predetermined setting for capturing images at the stationfor transmission to a remote station, audio means for transmittingand/or receiving audio signals from at least one remote station, anddisplay means for displaying an image including at least one sub-imagetransmitted to the station from the remote station so that the imagebecomes integrated with the first station predetermined setting tofacilitate providing a face-to-face presence teleconference.

In still another aspect of the invention, this invention comprises amethod for providing a virtual presence conference in a teleconferencingsystem having a first station and a second station comprising the stepof displaying an image formed from at least one sub-image from the firststation at a predetermined location in the second station such that theimage becomes visually integrated with the second station to define asingle predetermined aura at the second station.

In yet another aspect of the invention, this invention comprises amethod for teleconferencing comprising the steps of teleconnecting afirst station having a first setting to a second station having a secondsetting; and displaying a composite image including an image of at leasta portion of the first station at the second station such that when thecomposite image is displayed at the second station it cooperates withthe second setting to facilitate providing a face-to-face environment atthe second station.

In still another aspect, this invention comprises a method forteleconferencing comprising generating at least one first station signalgenerally corresponding to a first station image of the first station,comparing the at least one first station signal to a differentialreference signal corresponding to a first reference image and generatingat least one differential signal comprising a portion of the firststation image in response thereto, compositing the at least onedifferential signal with a predetermined composite signal correspondingto a predetermined image to provide at least one composite image, anddisplaying the at least one composite image corresponding to thecomposite signal at a second station.

In yet another aspect, this invention comprises a method for generatinga seamless image at a station from a plurality of sub-images at leastone of which is received from a remote station comprising the steps ofgenerating the plurality of sub-images, and combining the plurality ofsub-images with a predetermined composite image to provide the seamlessimage.

These advantages and objects, and others, may be more readily understoodin connection with the following specification, claims and drawings.

BRIEF DESCRIPTION OF THE ACCOMPANYING DRAWINGS

FIGS. 1A and 1B, taken together, show a teleconferencing systemaccording to one embodiment of this invention;

FIG. 2 is a partly broken away top view of a first station of theteleconferencing system shown in Fig. 1A;

FIGS. 3A and 3B, taken together, show another embodiment of the presentinvention wherein the stations have different predetermined sensorysettings;

FIGS. 4A and 4B, taken together, show still another embodiment of theinvention having stations which have predetermined sensory settingswhich are designed, decorated and defined to be complementary and/orsubstantially identical;

FIGS. 5A and 5B, taken together, provide a visual illustration of theimages corresponding to some of the signals generated by theteleconferencing system; and

FIGS. 6A-6D, taken together, show a schematic diagram of a methodaccording to an embodiment of this invention.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENT

Referring now to FIGS. 1A and 1B, a teleconferencing system 10 is shownhaving a first station or suite 12 and a second station or suite 14. Thefirst station 12 comprises a first conference or sensory area 16, andthe second station 14 comprises a second conference or sensory area18-1, respectively. The first and second stations 12 and 14 alsocomprise a first video area 20 and a second video area 22-1,respectively, associated with the first and second conference areas 16and 18-1. The first video area 20 is generally integral with a wall 32hin the first station 12. Likewise, the second video area 22-1 isgenerally integral with a wall 32h-1 in the second station 14. In theembodiment being described, the first and second stations aregeographically remote from each other, but they could be situated on thesame premises if desired.

For ease of illustration, the construction and modular assembly of thestations in teleconferencing system 10 will be described in relation tothe first station 12. As shown in the sectional top view of FIG. 2, thefirst station 12 is shown assembled or constructed into a generallyelongated octagonal shape. The first station 12 comprises a plurality ofmodular members 32a-32h which include walls 32a, 32c-e, 32g-h, doors inwall members 32b and 32f and entry facade 32f-32l . The first station 12also comprises a ceiling 34 (FIG. 1A) which is mounted on the members32a-32h with suitable fasteners, such as nuts, bolts, adhesives,brackets, or any other suitable fastening means. Notice that the ceiling34 has a dropped or sunken portion 34a which supports appropriatelighting fixtures 56.

In the embodiment being described, each of the members 32a-32h and theceiling 34 is molded or formed to provide or define an environmenthaving a unique architectural setting and/or sensory setting. Forexample, as illustrated in FIG. 1A, the wall member 32a may be formed toprovide a plurality of stones 36, a plurality of columns 38, and an arch40 to facilitate defining a first predetermined setting 12a having aRoman/Italian motif, theme or aura. One or more of the members 32a-32hmay be provided with inlays, wall decorations (like picture 58 in FIGS.1A and 2), or even a permanent frosted glass window and framearrangement 42 mounted therein. Furthermore, members 32b and 32f (FIG.2) may be provided with sliding doors 44 which facilitate entering andexiting the first station 12 and which are designed to complement orfurther enhance the Roman/Italian motif.

In the embodiment being described, notice that member 32h (FIGS. 1A and2) is formed to provide a stone and pillar appearance and texturecomplementary to the stone and pillar appearance and texture of the wallmembers, such as member 32a. Also, the member 32a may be shaped to frameor mask a rear projection screen 46, as shown. The function andoperation of the rear projection screen 46 will be described laterherein. In the embodiment being described, the rear projection screen 46comprises a high resolution lenticular rear projection screen which iseither integral with or mounted directly to member 32h to provide afirst video area 20 having a usable projection area of about 52 inchesby 92 inches with an associated aspect ratio of 16:9.

Each of the members 32a-32h and ceiling 34 are created in separatemodular units using a plurality of molds (not shown). In the embodimentbeing described, a suitable material for molding the members 32a-32h andceiling 34 to provide a granite-like appearance may be Gypsum, but theycould be formed from other suitable material such as stone or clay-basedmaterials, ceramic, paper, cardboard, foam, wood, Styrofoam and thelike. As illustrated in FIGS. 1A and 2, the member 32d may be providedwith a shelf or mantle 33. The various members 32a-32h are assembledtogether as shown in FIG. 2 and secured together with suitable supportbraces 48 which may be secured to the walls 32a-32h with any suitablefastener such as screws, bolts, an adhesive or the like. After the firststation 12 is assembled and the ceiling 34 is secured thereto, it has alength of about 14 feet, 6 inches (indicated by double arrow L in FIG.2) and a width of about 12 feet, 0 inches (indicated by double arrow Win FIG. 2). The first station 12 has an approximate height from floor toceiling 34 of about 8 feet, 6 inches. Further, the members 32a, 32c, 32eand 32g have a width (indicated by double arrow Y in FIG. 2) of about 5feet, 0 inch. Finally, the back wall member 32d and front wall member32h comprises a width of about 7 feet, 8 inches (indicated by doublearrow X in FIG. 2).

After the members 32a-32h and ceiling 34 are assembled, the firststation 12 may be further decorated, designed or ornamented with aplurality of subjects, decorations or ornaments which facilitateproviding the first predetermined sensory setting 12a which defines afirst aura, motif or theme. Likewise, the second station 14 maybefurther provided or ornamented with a plurality of subjects, decorationsor ornaments which facilitate providing a second predetermined sensorysetting 14a which defines a second aura, motif or theme. For example, asillustrated in FIG. 1A, the predetermined sensory setting 12a of thefirst station 12 may be further decorated with a table 50, tabledecorations, pillar and wall decorations, carpet (not shown), plants 54and other wall decorations (not shown) to further enhance theRoman/Italian motif, theme or aura. The first and second predeterminedsensory settings 12a and 14a may also comprise appropriate lightingfixtures 56 and appropriate furnishings, such as chairs 60 and tables61, which complement the predetermined setting to further facilitatedefining the Roman/Italian theme or motif for the stations 12 and 14.

It should be appreciated that once the first and second stations 12 and14 are assembled and ornamented or decorated to provide their respectivefirst and second predetermined sensory settings 12a and 14a, they definean aura, theme or motif which facilitates providing or creating a verysensual and impressionable environment. Providing such a station, suchas station 12, with a strong sensory environment facilitates enhancingthe virtual presence illusion created by teleconferencing system 10 ofthe present invention.

It should also be appreciated, however, that although the first station12 and second station 14 are shown in the embodiment in FIGS. 1A and 1Bas having complementary or similar first and second predeterminedsensory settings 12a and 14a, they could be provided with first andsecond predetermined sensory settings 12a and 14a having differentthemes, motifs or auras. Thus, while the embodiment described inrelation to FIGS. 1A and 1B illustrate a first and second set ofstations 12 and 14 having a Roman/Italian motif, another set ofstations, such as station 12' and station 14' in the embodimentillustrated in FIGS. 3A and 3B, may have at least one station having adifferent predetermined setting. For example, the second station 14' inFIG. 3B provides a setting 14a' which defines a Chinese aura, theme ormotif.

It should also be appreciated that the members 32a-32h, ceiling 34 andassociated predetermined sensory setting are provided to betransportable and capable of being assembled at any suitable location,such as an existing rectangular room, suite or conference area havingdimensions of at least 20 feet×20 feet×9 feet. While it may be desirableto provide the first and second stations 12 and 14 in theteleconferencing system 10 with substantially the same dimensions, itshould be appreciated that they could be provided with differingdimensions, depending on, for example, the number of participants ateach station. It should also be appreciated that the second station 14and other stations described herein would preferably be manufactured andassembled in the same or similar manner as the first station 12. Also,the stations in the teleconference system 10 may be decorated with wall,ceiling and floor coverings to provide, for example, the firstpredetermined sensory setting 12a without using the pre-formed or moldedmodular members 32a-32h described above, although the use of suchmembers may be preferable in this embodiment.

The teleconferencing system 10 also comprises conferencing means or aconferencing system means for teleconnecting the first and secondstations 12 and 14 together to facilitate capturing an image or imagesat one of said stations and displaying at least a portion of the imageor a sub-image at another of the stations such that it becomes generallyvisually integrated with the predetermined sensory setting at thatstation, thereby facilitating creating a "video mirror" and a"face-to-face" environment for the participant situated at that station.As shown in FIG. 1A, the conferencing system associated with the firststation 12 comprises image sensor means, imager or image sensors forsensing images at the first station 12. For the embodiment shown inFIGS. 1A and 2, the image sensor means comprises a plurality of cameraswhich are operably associated with the rear projection screen 46 offirst station 12. In this regard, the plurality of cameras comprise afirst camera head 62 and second camera head 64 which are operativelycoupled to a first camera control unit 66 and second camera control unit68, respectively. Notice that the first and second camera control units66 and 68 are remotely situated from the first and second camera heads62 and 64. This facilitates permitting the first and second cameras 62and 64 to be placed directly in the projection path of the rearprojection screen 46, without substantially interfering with the videoimage being projected.

In the embodiment being described, the first camera head 62 and secondcamera head 64 are situated approximately 16 inches above the surface oftable 50 which generally corresponds to the eye level of the seatedparticipants situated at table 50. As illustrated in FIG. 2, the firstand second cameras 62 and 64 are situated behind the rear projectionscreen 46 in operative relationship with a pair of 11/4 inch diameteropenings 66 and 68, respectively. The first and second cameras 62 and 64are mounted on a suitable narrow or non-interfering bracket (not shown)such that they can be positioned behind the rear projection screen 46 inoperative relationship with openings 66 and 68, respectively. In theembodiment being described, the first and second cameras 62 and 64 are11/4 inch by 11/4 inch 3-CCD camera heads which generate images havingan aspect ratio of about 3:4 and a picture resolution of about 494×700pixels. One suitable 3-CCD camera heads 62 and 64 and associated cameracontrol units 66 and 68 may be Model No. GP-US502 manufactured byPanasonic Broadcast and Television Systems Company of Japan. It shouldbe appreciated that while the teleconferencing system 10 shown anddescribed in relation to FIGS. 1A and 1B show image sensor meanscomprising a plurality of camera heads 62 and 64 and camera controlunits 66 and 68 situated at a station, a single camera may be used (asshown and described relative to the embodiment shown in FIGS. 4A and 4B)or even multiple cameras could be used depending on such things as thesize of the station, the number of participants situated at the station,and/or the aspect ratio of each camera head selected. It should also beappreciated that the camera heads 62 and 64 and associated cameracontrol units 66 and 68 are configured and positioned at the firststation 12 to facilitate providing maximum vertical eye contact amongparticipates in the teleconference, while minimally interrupting thesubstantially life-size video projection on the rear projection screen46.

The conferencing means also comprises a first differentiator ordifferential key generator 70 (FIG. 1A) and a second differentiator ordifferential key generator 72, respectively. The camera control unit 66generates an RGB analog signal I-62 which is received by the firstdifferentiator 70, and the camera control unit 68 generates an RGBsignal I-64 which is received by the second differentiator 72. The firstand second differentiators 70 and 72 provide means for processing theimage signals generated by the camera control units 66 and 68 to removeor differentiate any undesired portion of the images corresponding tothe signals I-62 and I-64. For example, as described in detail laterherein, it is desired in this embodiment to separate the image of theparticipants situated at the first station 12 from at least a portion ofthe first predetermined sensory setting 12a, such as the backgroundbehind the participants, in order to provide a differential signal VS-1that has that portion of the first predetermined sensory setting 12Aremoved. This, in turn, facilitates transmitting the video image of theparticipants at the first station 12 to the remote second station 14 andalso facilitates compositing the image with other images, as describedbelow.

Suitable differentiators 70 and 72 may comprise the differential keygenerator shown and described in U.S. Pat. No. 4,800,432, issued on Jan.24, 1989 to Barnett et al. and assigned to The Grass Valley Group, Inc.,which is incorporated herein by reference and made a part hereof.

The differential key generators 70 and 72 convert the I-62 and I-64signals from RGB analog signals to digital image signals havingcorresponding images 104 and 106 (FIG. 5A), respectively. Thedifferential key generators 70 and 72 compare the digital image signalsto an associated differential reference signals DRS-62 and DRS-64,respectively, which generally corresponds to images 108 and 110 in FIG.5A. As described in detail later herein, these images 108 and 110comprise at least a portion of the first predetermined sensory setting12a such as the background. The differential reference signals DRS-62and DRS-64 are stored in appropriate storage 74 and 76 (FIG. 1A)associated with the differential key generators 70, 72, respectively. Inthe embodiment being described, the differential reference signalsDRS-62 and DRS-64 comprise a reference frame of a video image grabbed byone or both cameras 62 or 64 situated at the first station 12 from avideo sequence of the first predetermined sensory setting 12a of thefirst station 12 background where no participants, chairs, or otherforeground elements are in place.

In response to the comparison, the first and second differentiators 70and 72 generate differentiated video signals VS-1 and VS-2 (FIG. 1A),respectively. As illustrated in FIG. 5, the VS-1 and VS-2 signalsgenerally correspond to the individuals situated at the first station 12when viewed in the direction of arrow A in FIG. 2. As illustrated in theimages 112 and 114 (FIG. 5) associated with the VS-1 and VS-2 signals,respectively, notice that the background area shown in images 104 and106 has been removed and is tagged as a "zero" image area.

Advantageously, tagging at least a portion of the image represented bythe VS-1 signal as "zero" background facilitates compressing the VS-1and VS-2 signals and providing corresponding compressed CDS-1 and CDS-2signals, thereby reducing the amount of transmission band width needed.This tagging also facilitates compositing or overlaying anotherpredetermined image to provide a seamless composited image as describedin detail below.

The video signals VS-1 and VS-2 are received by a firstcompression/decompression means or CODEC 78 and a secondcompression/decompression means or CODEC 80, respectively. The CODECs 78and 80 also receive an audio signal AS-A1 and AS-A2 from suitablemicrophones 82 and 83, respectively, which may be positioned orconcealed at an appropriate location in the first station 12, such asunderneath or on top of table 50, as illustrated in FIG. 1A. Thefunction of the first and second CODEC 78 and 80 is to compress videoand audio signals for transmitting to remote stations, such as thesecond station 14, and also to decompress compressed video and audiosignals received from remote stations. Consequently, the CODECs 78 and80 are configured with suitable compression and decompression algorithmswhich are known to those of ordinary skill in the art. The CODEC ModelNo. Rembrandt II VP available from Compression Labs, Inc. of San Jose,Calif. is suitable for use in the embodiment described herein, but itshould be noted that other suitable compression/decompression means maybe employed.

The CODEC 78 receives the video signal VS-1 and audio signal AS-A1, andCODEC 80 receives the video signal VS-2 and audio signal AS-A2. TheCODECs 78 and 80, generate digital signals CDS-1 and CDS-2,respectively, in response thereto which are in turn transmitted toremote station 14 via a transmission network 84.

The transmission network 84 may be configured as a private network,public circuit switch service, and it may utilize telecommunicationand/or satellite technology. In the embodiment being described, thetransmission network 84 preferably includes a plurality of T-1 lines(not shown) which are capable of accommodating bit streams having asuitable band width, such as 1.544 megabytes per second.

The teleconferencing system 10 and conference means associated with thefirst station 12 also comprises enhancing means for enhancing theresolution of an image or sub-image received from a remote station, suchas the second station 14. In the embodiment being described, enhancingmeans comprises a first line doubler 86 and a second line doubler 88which are operatively coupled to the first CODEC 78 and second CODEC 80,respectively. In this embodiment, the first and second line doublers 86and 88 enhance the resolution and picture quality of at least a portionof the image corresponding to video signals VS-3 and VS-4 received fromthe CODECs 78 and 80, respectively, by about 50-1500%. The VS-3 and VS-4signals correspond to images or sub-images received from remotestation(s), such as station 14, as described in detail below. Onesuitable line doubler is the Model No. LD 100 available from FaroudjaLaboratories, Inc. of Sunnyvale, Calif., but other suitable enhancingmeans may be provided to provide greater or less enhancement of theimages to be displayed. For example, lenses, mirrors, optical pixelinterpolation or other electrical means may be employed as desired. Itshould also be noted that the present invention may be performed withoutthe use of any enhancing means without departing from the scope of theinvention.

The first and second line doublers 86 and 88 generate enhanced videosignals which are input into compositing means, compositor or videocompositing multiplexer 92 for compositing the enhanced video signalsassociated with the images or sub-images received from the remotestation(s) with one or more predetermined composite signals, such aspredetermined composite signal A, corresponding to a predeterminedcomposite image or sub-image which are stored in a suitable storagedevice 94 associated with the compositor 92. In the embodiment beingdescribed, the predetermined composite signal A corresponds to an imageof at least a portion of first predetermined sensory setting 12a, suchas the background of the first station 12. The video compositingmultiplexer 92 composites the signals received from the first and secondline doublers 86 and 88 with the predetermined composite signal A andgenerates a RGB analog composite signal in response thereto. It has beenfound that Model No. E-Space-1 available from Miranda Technologies, Inc.of Montreal and Quebec, Canada, is one suitable video compositingmultiplexer 92.

The teleconferencing system 10 comprises a projector 96 coupled to thevideo compositing multiplexer 92 which receives the RGB composite signaland projects a corresponding image 90 (FIG. 1A) corresponding to thecomposite signal on the rear projection screen 46. The Model No. 3300available from AMPRO Corporation of Titusville, Fla. has been found tobe a suitable projector 96. Although the embodiment has been describedusing projector 96 and rear projection screen 46, other suitable meansmay be employed for projecting or displaying the composited image. Forexample, a liquid crystal display (LCD) or other electronic screen maybe suitable to display images at a station. This may eliminate the needfor the projector 96.

The projector 96 could be used with an optical system or a plurality ofmirrors (not shown), or prisms (not shown) such that the projector canbe positioned, for example, to the side or below the rear projectionscreen 46 or in a manner that permits the projector 96 to project theimage towards a mirror (not shown), which causes the image to beprojected on the rear projection screen 46.

As described in detail below, the composite signal and its correspondingimage 90 generally comprise a video image of at least a portion of thefirst predetermined sensory setting 12a combined or composited with adifferentiated image, such as an image of the participants from thesecond station 14 which correspond to the VS-3 and VS-4 (FIG. 1B)signals. Consequently, the resultant image 90 projected on screen 46 atthe first station 12 complements or blends with the architectural motif,aura, theme or design defined by the first predetermined sensory setting12a at the first station 12, such that the projected image 90 appearsvisually integrated with the first predetermined sensory setting 12a ofthe first station 12. This, in turn, causes any image of theparticipants situated at the second station 14 and included in the image90 to appear to be face-to-face with participants at the first station12 during the teleconference. The operation of the compositor 92 isdescribed in more detail later herein.

It should be appreciated that the sub-images or images received from theremote station(s) typically have a resolution on the order of about352×288 pixels and the predetermined composite signal A comprises aresolution on the order of about 1280×1024 pixels. Thus, the resultantcomposite image 90 may comprise, for example, an image of theparticipants situated at the second station 14 having a first resolutionand a background image of the first station 12 having a secondresolution, which is higher than the first resolution. This enablescompositor 92 to provide a composite image 90 which, when displayed onscreen 46, gives the illusion or effect of a "video mirror" to theparticipants situated at the first station 12.

The teleconferencing system 10 also includes audio means comprising aplurality of speakers 100 and 102 (FIGS. 1A and 2) which, in turn,receive audio signals AS-B1 and AS-B2 from CODECs 78 and 80,respectively. It should be appreciated that the audio signal AS-B1 andAS-B2 generally correspond to the audio associated with the sound (e.g.,voices, music and the like) associated with the remote station(s), suchas second station 14.

It should also be appreciated that the rear projection screen 46 andprojector 96 are configured and selected to enable the teleconferencingsystem 10 to project the composited image 90 (FIG. 1A) at apredetermined scale, such as substantially full scale. In this regard,the compositor 92 comprises a scaler 95 which is integral therewith forscaling the composited signal associated with the composited image 90 toa desired or predetermined scale, such as substantially full scale.

Referring now to FIG. 1B, the second station 14 comprises similarcomponents as the first station and such like components are labelledwith the same reference numeral as their corresponding component in thefirst station 12, except that the components associated with the secondstation 14 have a "-1" designator added thereto. Such components operateand function in substantially the same manner as described above withregard to the first station 12 with the following being somedifferences. The differential reference signals DRS-3 and DRS-4 (FIG. 5)associated with the second station 14 generally correspond to an imageor sub-image of at least a portion of the second predetermined sensorysetting 14a, such as the background 98-1, of the second station 14. Suchsub-image or image may include at least a portion of the background 98-1without any participants, chairs or other foreground subjects situatedin the second station 14. Also, like the predetermined composite signalA stored in the storage 94 associated with the first station 10, apredetermined composite signal B may be stored in the storage 94-1associated with the compositor 92-1 second station 14.

The predetermined composite signal B may correspond to an image orsub-image of at least a portion of the second predetermined sensorysetting 14a of the second station 14. Such sub-image or image mayinclude, for example, an image of the walls 32a-1 to 32h-1 andconference area 18 or background of the second station 14. Notice thatin the embodiment shown in FIGS. 1A and 1B, the second station 14 has asecond predetermined sensory setting 14a which mirrors or iscomplementary to the first predetermined sensory setting 12a. Asdescribed above, however, the first and second predetermined sensorysettings 12a and 14a may be different.

A method of operating the teleconferencing system 10 will now bedescribed in relation to FIGS. 6A-6D. The modular components, such asmembers 32a to 32h and ceiling 34 for first station 10, decorations andthe like, are configured, assembled and decorated (block 99 in FIG. 6A)at a desired location to provide a conference station comprising apredetermined sensory setting defining a predetermined theme, motif oraura. As mentioned earlier herein, the theme, motif or aura may becomplementary (as shown in FIGS. 1A and 1B) or they can be completelydifferent, as shown in FIGS. 3A and 3B (described below). For ease ofillustration, it will be assumed that the stations are assembled anddecorated as shown and described relative to the embodiment in FIGS. 1Aand 1B.

Once the modular stations 12 and 14 are assembled and decorated, it maybe desired (decision point 101 in FIG. 6A) to use differentiator (e.g.,differentiator 72 in FIG. 1A). As discussed herein relative to theembodiments shown in FIGS. 4A and 4B, it may not always be desired togenerate a differential reference image, thereby making it unnecessaryto generate the differential reference signal. If differentiation isdesired, then the camera heads 62 or 64 generate at least one videoimage (block 103) of at least a portion of the first predeterminedsensory setting 12A at the first station 12. The differentiators 72 and74 grab or capture at least one differential reference image orsub-image from those images and generate (block 107) the differentialreference signals DRS-62 and DRS-64, respectively. These signals arestored in suitable storage 74 and 76 for use by the differentiators 70and 72, respectively. Likewise, cameras 62-1 and 64-1 at the secondstation 14 generate video images of at least a portion of the secondpredetermined setting 14a at the second station 14. The differentiators70-1 and 72-1 grab or capture at least one differential reference imageor sub-image from those images and generate differential referencesignals (not shown) corresponding thereto. These signals are then stored(block 109) in suitable storage 74-1 and 76-1 for use by differentialkey generators 70-1 and 72-1, respectively.

As mentioned above, it is preferred that the differential referencesignals DRS-62 and DRS-64 comprise an image of at least a portion of thefirst predetermined sensory setting 12a, such as an image of the firststation 12 without any participants, chairs or other subjects which arenot stationary during the teleconference. Likewise, it is preferred thatthe differential reference signals associated with the differentiators70-1 and 72-1 comprise at least a portion of the second predeterminedsensory setting 14a at the second station 14, such as an image of thebackground 98-1 without the participants, chairs and other subjectswhich are not stationary during the teleconference.

If differentiation of signals is not selected or at the end of thedifferentiation process, it may be desired to generate a composite image(decision point 97) for one or more of the stations. As discussed below,however, this may not always be required to achieve certain advantagesof the invention. Such predetermined composite image would preferablyinclude a substantial portion of the first predetermined sensory setting12a, including the background and/or conference area 16 of the firststation 12. If compositing is desired, then the predetermined compositesignal A is generated (block 111 in FIG. 6B). The correspondingpredetermined composite signal A may then be stored in suitable storage94. In the same manner, the predetermined composite image at the secondstation 14 and corresponding predetermined composite signal B may begenerated and stored as predetermined composite signal B in suitablestorage 94-1. In the embodiment being described, the predeterminedcomposite image associated with the second station 14 includes an imageof at least a portion of the second predetermined sensory setting 14a,including the background 98-1.

In the embodiment being described, the predetermined composite signals Aand B are generated by a suitable still camera (not shown) to provide astill image (not shown) of the station 12 or 14 being photographed. Thestill image would subsequently be scanned and digitized for storage by asuitable scanner (not shown). The still camera and scanner wouldpreferably be capable of generating images having a resolution on theorder of about 1280×1024 pixels. Thus, if compositing is performed, theresultant composite image (such as image 90 in FIG. 1A) may comprise animage having a high resolution background, for example, combined with acomparatively lower resolution image of the remote station participants.This, in turn, facilitates enhancing the "video mirror" effect wherein amimic or replication of a common architectural technique of mirroring awall of a given room which makes the overall room appear to be extendedbeyond its actual wall line.

Once the stations 12 and 14 are configured and the differentialreference signals and predetermined composite signals A and B aregenerated and stored, the first and second suites 12 and 14 may then beteleconnected (block 113) or connected by satellite or other suitablemeans via the transmission network 84.

Next, one or more participants may be situated at the first and secondstations 12 and 14. As illustrated in FIG. 2, notice that theparticipants seated at the first station 12 are situated a predetermineddistance B from a participant's side 46a of the rear projection screen46. The predetermined distance B generally corresponds to a preferred oroptimum focal distance at which optimum imaging by cameras 62 and 64 maybe performed. In the embodiment being described, it has been found thatthe predetermined distance should be about 5 feet, 6 inches. Theparticipants are situated at the second station 14 in a similar mannerand the face-to-face teleconference may then begin.

For ease of illustration, the imaging and display of first station 12participants at the second station 14 will be described. The first andsecond cameras 62 and 64 capture (block 117 in FIG. 6B) live images ofthe participants situated at the first station 12 and generatecorresponding RGB analog signals I-62 and I-64 which are received by thedifferential key generators 70 and 72, respectively. If differentiationwas selected (decision point 147 in FIG. 6C), processing continues atblock 119 otherwise it proceeds at block 123. The differential keygenerators 70 and 72 generate (block 121 in FIG. 6C) the digitaldifferential signal VS-1 and VS-2, respectively, after comparing (block119 in FIG. 6C) the I-62 and I-64 signals received from cameras 62 and64 to their respective differential reference signals DRS 62 and DRS-64which are received from storages 74 and 76.

The differential signals VS-1 and VS-2 are then received by CODECs 78and 80 which also receive the audio signals AS-A1 and AS-A2 whichcorrespond to the audio, including sounds, music and voices, associatedwith the first station 12. The CODECs 78 and 80 digitize the audiosignals AS-A1 and AS-A2, combine the audio signals with their respectivevideo signal VS-1 or VS-2, and generate (block 123) the compressed CDS-1and CDS-2 signals in response thereto. The CDS-1 and CDS-2 signals arethen transmitted (block 125) to the second station 14 via thetransmission network 84 (FIG. 1B).

The CDS-1 and CDS-2 signals are received and decompressed (block 127 inFIG. 6C) by CODECs 78-1 and 80-1, respectively, associated with thesecond station 14 to provide decompressed VS-1 and VS-2 signals. TheCODECs 78-1 and 80-1 also decompress the audio signals AS-A1 and AS-A2received from the first station 10 which are transmitted to speakers100-1 and 102-1, respectively, at the second station 14.

Substantially simultaneously with the broadcasting of the audio signalsat the second station 14, CODECs 78-1 and 80-1 decompress the CDS-1 andCDS-2 signals to provide VS-1 and VS-2 signals. The decompressed videosignals VS-1 and VS-2 are then received by line doublers 86-1 and 88-1.If it is desired to enhance the signals (decision point 129), then theline doublers 86-1 and 88-1 process or manipulate the signals (block131) in order to enhance the resolution of the image corresponding tothose signals. After the signals VS-1 and VS-2 are processed, it may bedesired to composite (decision point 133 in FIG. 6D) those signals withone or more other signals. In this illustration, for example, the videocompositor 92-1 composites images (block 135) corresponding to thosesignals with at least one predetermined composite image, such as image122 (FIG. 5B) corresponding to the predetermined composite signal Bprovided from storage 94-1 (FIG. 1B) to provide a composite signal. Asmentioned above, the composite signal generally corresponds to thecomposited image 91-1 to be displayed on the rear projection screen 46-1at the second station 14.

The compositor 92-1 may (decision point 137, block 139 in FIG. 6D) scalethe composited image to a desired scale, such as full scale, usingscaler 95-1. Thereafter, the compositor 95-1 transmits a correspondingRGB analog signal to projector 96-1 which displays (block 141) thescaled, composited image on the rear projection screen 46-1 (FIG. 1B).

The teleconference may then be continued or terminated as desired(decision point 143, block 145).

Because the composited image is substantially full scale when projectedand includes a high resolution image of at least a portion of the secondpredetermined sensory setting 14a, the image appears to blend or becomevisually integrated with the second predetermined sensory setting 14a.This, in turn, gives the participants situated at the second station 14the perception that the first station participants are present orface-to-face with them in the second station 14.

In the same or similar manner, images and signals relative to the secondstation 14 images are captured, processed and displayed at the firststation 12. So that images of the participants at the second station 14are displayed at the first station 12 such that they appear to have aface-to-face presence at the first station 12. Thus, images of thesecond station 14 participants may be differentiated and composited suchthat, when they are displayed at the first station 12, the imagecompletes or provides "the other half" of the first station 12 andbecomes generally visually integrated therewith. Although not required,it may be desirable to enhance the face-to-face presence by providing,for example, first and second predetermined sensory settings 12a and 14awhich define a dining environment wherein food or meals may be served.For example, the face-to-face presence may be further enhanced if theparticipants at both stations 12 and 14 order food and drinks fromidentical menus. Also, trained maitre-de and/or waiters may be used toactively promote the perception of a face-to-face dinner using ascripted dialog and interaction with remote participants, maitre-deand/or waiters.

Once the teleconferencing is terminated, the stations 12 and 14 may beused by the same or different participants without the need toreconstruct or reassemble the stations.

FIGS. 5A and 5B provide a visual illustration of the imagescorresponding to some of the signals described above utilizing themethod and embodiment described above. In this regard, images 104 and106 generally correspond to the actual images captured by the first andsecond cameras 62 and 64, respectively. As described above, associatedimage signals I-62 and I-64 are transmitted to the differential keygenerators 70 and 72, respectively. The differential key generators 70and 72 compare the images 104 and 106 to the images 108 and 110associated with the differential reference signals DRS-62 and DRS-64which are received from storages 74 and 76, respectively, and which werepreviously generated by cameras 62 and 64 from an identical fixed cameraposition.

As illustrated in FIG. 5A, the differential key generators 70 and 72generate differential signals VS-1 and VS-2 which have correspondingimages 112 and 114. Notice that these images 112 and 114 comprise animage of the participants which are situated at the first station 12with the background area having been removed or tagged as a "zero" area.As described herein, this "zero" area becomes "filled-in" with thedesired or predetermined composite image which may include, for example,an image of at least a portion of the predetermined setting orbackground of the second station 14. It has been found that removing aportion of the image, such as the background, by tagging it as zero, inthe manner described herein, facilitates compressing the signals VS-1and VS-2 and reducing the amount of bandwidth needed to transmit theimages over transmission network 84 and between the first and secondstations 12 and 14.

As mentioned above, the video signals VS-1 and VS-2 are fed into CODECs78 and 80 which compresses the signals along with audio signal AS-A1 andAS-A2 and generates signals CDS-1 and CDS-2. The CDS-1 and CDS-2 signalsare then transmitted, via transmission network 84, to the second station14 and received by the CODECs 78-1 and 80-1 associated with the secondstation 14. As illustrated in FIG. 5B, the CODEC 78-1 and 80-1decompresses the CDS-1 and CDS-2 signals, respectively, from the firststation 12 and feeds them into associated line doublers 86-1 and 88-1.As mentioned earlier herein, the line doublers 86-1 and 88-1 facilitateenhancing the images associated with the video signals to provideenhanced video signals EVS-1 and EVS-2 (FIG. 5B), respectively.

As stated earlier, the enhanced video signals EVS-1 and EVS-2 are thenreceived by the video compositing multiplexer 92-1 associated with thesecond station 14 wherein the signals are combined to provide anintermediate composite signal ICS having an associated intermediatecomposite signal image 120 having an aspect ratio of about 8:3.

The video compositing multiplexer 92-1 also receives the predeterminedcomposite signal B having a predetermined composite signal B image 122from storage 94-1. The video compositing multiplexer 92-1 composites orcombines the images 120 and 122 to generate the composite signal havingan associated or corresponding composite image 124 as shown in FIG. 5B.As stated earlier, the predetermined composite signal B image 122generally corresponds to at least a portion of the predetermined settingor background of the second station 14 and has an aspect ratio of 16:9.

Notice that when the predetermined composite signal B image 122 iscombined with the intermediate composite signal image 120, the videocompositing multiplexer 92-1 causes the "zero" area of the intermediatecomposite signal image 120 to be "filled in" with the predeterminedcomposite signal B image.

The composite image 124 may then be scaled to a predetermined size orscale, such as full scale, using scaler 94-1, so that the compositeimage 124 may be scaled to a substantially full scale or real-life sizeimage as desired. The composite image signal corresponding to thecomposite image 124 is transmitted to the projector 96-1 and thendisplayed on the rear projection screen 46-1 at the second station 14.As illustrated in FIGS. 1B and 5B, the composite image 124 may beappropriately framed or masked (such as with an archway 125 in FIGS. 1Band 5B) when it is projected at the second station 14 to enhance theface-to-face, real time environment.

The audio and video signals transmitted between the first and secondstations 12 and 14 may be, in this illustration, transmitted overseparate T-1 lines (not shown) in the transmission network 84 in orderto effect a substantially simultaneous and/or "real time" videoconference. Thus, in the illustration shown in FIGS. 1A and 1B, theparticipants may be geographically remotely located, yet theparticipants situated at the first station 12 will feel as if the secondstation 14 participants are located face-to-face or present with them atthe first station 12, while the participants situated at the secondstation 14 will feel as if the first station participants areface-to-face or present with them at the second station.

It should be appreciated that when the predetermined composite signal Band associated predetermined composite signal image 122 is compositedwith the intermediate composite signal and associated intermediatecomposite signal image 120, it overlays that signal to provide aseamless composite image 124, which facilitates reducing or eliminatingthe need to match up the borders or seams of the camera images with anyhigh degree of accuracy. In this regard, it is preferable that cameras62 and 64 and 62-1 and 64-1 preferably be situated such that theycapture an entire participant rather than, for example, half of aparticipant. Thus, it may be desired to position the participants in alocation such that any particular participants will not be in the fieldof view of more than one camera.

Advantageously, the invention provides an apparatus and method forproviding a video mirror at each station 12 and 14 which facilitatescreating a face-to-face and non-interrupted image of any participants inthe video conference. Because the image of the participants isdifferentiated, less transmission bandwidth, computer memory and thelike is required. Also, the differentiators and compositors of thepresent invention enable a user to create a composite image 124 (FIG.5B) having at least a portion thereof imaged at a greater resolutionthan the portion which was transmitted over transmission network 84.This facilitates reducing the effect of limitations or transmissionrestrictions of the transmission network 84 which, in turn, facilitatesincreasing the quality of images displayed at a station.

In addition, notice that the composite image 124 (FIG. 5B) may have anaspect ratio which is different from the aspect ratio of the cameras 62and 64. This enables the system and method of the present invention toutilize cameras which generate images having smaller or even largeraspect ratios. This also enables the system and method to use camerashaving standard or common aspect ratios, such as 4:3.

FIGS. 3A and 3B, when taken together, illustrate another embodiment ofthe invention. The operation and components of the embodiment shown inFIGS. 3A and 3B are substantially the same as the operation ofcomponents of the embodiment described above relative to FIGS. 1A and 1Bwith the same reference numerals being used for the same components withthe addition of single prime (') designator. Consequently thisembodiment is similar to the embodiment shown in FIGS. 1A and 1B, exceptthat the second predetermined setting 14a' in FIG. 3B and its associatedtheme, aura or motif is substantially different from the secondpredetermined setting 14a shown in FIG. 1B. In FIG. 3B, the firstpredetermined sensory setting 12a' comprises a plurality of decorations120 defining the Chinese theme, motif or aura. Also, the predeterminedcomposite signal A stored in storage 94-1' and the differentialreference signals stored in storages 74-1' and 76-1 would generallycorrespond to an image of at least a portion of that setting 14a'.

As with the illustration described above relative to FIGS. 1A and 1B,the video and audio signals would be processed in substantially the samemanner. In general, an image of the participants situated at the firststation 12' is composited by compositor 92-1' with a predeterminedcomposite image of at least a portion of the second predeterminedsensory setting 14a' of the second station 14' and projected onto therear projection screen 46-1' at the second station 14'. The firststation 12' participants appear to be face-to-face with the secondstation 14' participants because they have a relatively high resolutionvideo image behind them which complements or becomes integrated with thesecond predetermined sensory setting 14a'. Thus, as shown in FIG. 3B,the image 91-1' (FIG. 3B) of the ladies at the first station 12'includes a Chinese background which blends or complements the actualpredetermined sensory setting 14a'.

Likewise, when the image of the participants situated at the secondstation 14' is projected on the rear projection screen 46' at the firststation 12', they appear to be in the same room as the participantssituated at the first station 12' because the Roman/Italian videobackground which is seen behind the second station 14' participantsgenerally complements and becomes visually integrated with the actualRoman/Italian theme, motif or aura defined by the first predeterminedsensory setting 12' of the first station 12'.

FIGS. 4A and 4B, when taken together, illustrate another embodiment ofthe invention. The components of the embodiment shown in FIGS. 4A and 4Bwhich are substantially identical to the components in the embodimentshown in FIGS. 1A and 1B which have the same reference numerals with theaddition of a double prime (""") designators. As illustrated in FIGS. 4Aand 4B, two remote modular stations such as stations 12" and 14" may beprovided and designed to have first and second predetermined sensorysettings 12a" and 14a" which are substantially identical. Thus, as shownin FIGS. 4A and 4B, images may be captured in the manner described aboveat station 12" received by CODECs 78" and 80" and then transmitted, viatransmission 84", to associated CODECs 78-1" and 80-1", respectively.The CODECs 78-1" and 80-1" then generate a decompressed signal which maybe enhanced by line doublers 86-1" and 88-1", respectively; scaled to anappropriate scale by scaler 95-1"; and then projected by projector 96-1"onto rear projection screen 46-1".

Notice that the image comprising the second station 14" participants andsecond predetermined sensory setting 14a" is displayed on screen 46" atthe first station 12". Thus, this embodiment does not utilize thedifferentiating and compositing features of the previous embodiment, butmay still achieve a face-to-face conference environment because thesecond predetermined sensory setting 14a" is configured to be identicalto or complementary with the first predetermined sensory setting 12a".In this embodiment, entire images or sub-images of the stations 12 and14 (including images of both participants and background) are displayedat remote station(s). Because the stations 12" and 14" are assembled,decorated and designed to be complementary or identical, they appearvisually integrated to participants situated in the stations 12 and 14.Accordingly, the first and second predetermined sensory settings 12a"and 14a", including the background, are designed and arranged in ageometric fashion such that as cameras 62" and 64" capture images of theparticipants, they also capture images of the first and secondpredetermined sensory setting 12a" and 14a", respectively, at the mostadvantageous perspective for display at the remote station(s). As withprior embodiments, this causes the first station 12" participants toperceive that the second station 14" participants are situated orpresent with the first station 12" participants at the first station14". Likewise, the first station 12" participants appear to beface-to-face with the second station 14" participants at the secondstation 14" when the images associated with the first station 12" aredisplayed on screen 46-1". Consequently, by providing complementary oridentical first and second predetermined sensory settings 12a" and 14a",a face-to-face conference may be created. As with previous embodiments,it may also be desired to differentiate, enhance, composite or scale theimages as described with previous embodiments, but this is not requiredwith the embodiment being described.

Thus, it should be apparent that stations can be provided withpredetermined settings which are completely different, yet, by utilizingthe apparatus and method of the present invention, the images of theparticipants in these stations may be projected at remote stations sothat they appear to be virtually face-to-face with the remote stationparticipants at one or more remote station.

Various changes or modifications in the invention described may occur tothose skilled in the art without departing from the spirit or scope ofthe invention. For example, the screen 46 for station 12 has been shownas being integral with a portion of a wall 32h (FIGS. 1A and 2A), itcould comprise a larger or smaller portion of that wall 32h, or it couldbe provided as part of one or more other walls, or even as part of theceiling 34.

It should also be appreciated that while the embodiments have been shownand described comprising two stations, images from more than two remotestations may be displayed at a station, thereby permitting ateleconference convention among more than two stations.

Although not shown, one or more of the compositors, such as compositors12 or 12-1 (FIG. 1A) may comprise a stationary or moving image database(not shown) for providing a plurality of predetermined composite signalswhich define a particular or desired video background. For example,participants may elect to use the arched background of their proximity,choose an event-related scene, or decide to meet in a setting completelyunrelated to their site or station. For example, a station having aManhattan eatery motif may be provided with a screen configured as awindow (not shown). Certain moving video backgrounds of a busy New Yorkavenue may be deposited and displayed on the screen to give the illusionthat the participants situated at the station are dining in a popularManhattan eatery.

It should also be appreciated that while the embodiments being shown anddescribed herein refer to teleconferencing environments that havepredetermined settings and motifs or auras relating to dining, thepredetermined settings could define any type of aura, theme or motifwhich is suitable for video conferencing and in which it is desired toprovide a "real-life" or face-to-face presence illusion. For example,the apparatus and method of this invention could be used in a businesssetting, education setting, seminar setting, home environment, religioussetting, celebration setting (such as a birthday, retirement party,holiday or anniversary), or any other suitable setting as desired.

The above description of the invention is intended to be illustrativeand not limiting, and is not intended that the invention be restrictedthereto but that it be limited only by the spirit and scope of theappended claims.

What is claimed is:
 1. A video mirror system for use in a videoconference, comprising a plurality of stations comprising:a display; andan imager coupled to said display for generating a superimposed imagewhich is not a cartoon animation, said superimposed image comprising atleast a portion of one of said plurality of stations combined with animage of at least one participant from said one of said plurality ofstations and also for causing said display to display said superimposedimage such that when said superimposed image is displayed at anon-remote station having a predetermined motif during the videoconference the at least one participant appears life-size andface-to-face in the presence of a participant at the non-remote station.2. The video mirror system as recited in claim 1 wherein said imagercomprises a differentiator.
 3. The video mirror system as recited inclaim 1 wherein said imager comprises a compositor coupled to saiddifferentiator.
 4. A teleconferencing method comprising the stepsof:capturing image data corresponding to an image; processing the imagedata to provide differentiated image data, said differentiated imagedata corresponding to a portion of said image; transmitting saiddifferentiated image data to a teleconferencing station defining amotif; and displaying a non-cartoon animated differentiated imagecorresponding to said differentiated image data at said teleconferencingstation such that said image complements said motif of saidteleconferencing station so that subjects in the image appear to bephysically present at said teleconferencing station.
 5. Theteleconferencing method as recited in claim 4 wherein said imagecomprises a portion which is desired to be removed from said image priorto said transmitting step, said method further comprising the stepof:differentiating said portion from said image prior to saidtransmitting step.
 6. The teleconferencing method as recited in claim 5wherein said portion is a background.
 7. The teleconferencing method asrecited in claim 4 wherein said method further comprises the stepof:compressing said differentiated image data to provide compressedimage data prior to said transmitting step.
 8. The teleconferencingmethod as recited in claim 5 wherein said method further comprises thestep of:compressing said differentiated image data prior to saidtransmitting step.
 9. The teleconferencing method as recited in claim 7wherein said method further comprises the step of:decompressing saidcompressed image data at said teleconferencing station.
 10. Theteleconferencing method as recited in claim 4 wherein said methodfurther comprises the steps of:combining said differentiated image datawith a second set of data corresponding to a second image to providecombined image data; displaying a combined image corresponding to saidcombined image data at said teleconferencing station.
 11. Theteleconferencing method as recited in claim 4 wherein said methodfurther comprises the step of:combining said differentiated image datawith a second set of data corresponding to a second image to providecombined image data, said second image having a resolution which ishigher than said image.
 12. The teleconferencing method as recited inclaim 4 wherein said method further comprises the step of:combining saiddifferentiated image data with a second set of data corresponding to abackground of said teleconferencing station.
 13. The teleconferencingmethod as recited in claim 4 wherein said method further comprises thesteps of:capturing said image data at a remote station; differentiatingsaid image data to remove a portion of the image.
 14. Theteleconferencing method as recited in claim 4 wherein said methodfurther comprises the steps of:capturing said image data at a remotestation; differentiating said image data to remove a background in theimage.
 15. The teleconferencing method as recited in claim 4 whereinsaid method further comprises the step of:displaying said differentiatedimage corresponding to said differentiated image data on arear-projection screen at said teleconferencing station.
 16. Theteleconferencing method as recited in claim 4 wherein said methodfurther comprises the step of:displaying said differentiated imagecorresponding to said differentiated image data at said teleconferencingstation, wherein said teleconferencing station comprises a screen havinga participant table situated in proximity therewith, said participanttable being configured to cause said participants to be situated apredetermined distance from said screen.
 17. The teleconferencing methodas recited in claim 16 wherein said screen is a rear-projection screen.18. The teleconferencing method as recited in claim 16 wherein saidpredetermined distance is not less than about 5 feet.
 19. Theteleconferencing method as recited in claim 8 wherein said participanttable comprises a convex edge opposed relationship to said screen. 20.The teleconferencing method as recited in claim 4 wherein said methodfurther comprises the step of:enhancing the differentiated image at saidteleconferencing station from a first resolution to a second resolution,wherein said second resolution is higher than said first resolution. 21.The teleconferencing method as recited in claim 4 wherein said methodfurther comprises the step of:displaying said differentiated image suchthat subjects in said image appear at substantially full scale.
 22. Theteleconferencing method as recited in claim 21 wherein said methodfurther comprises the step of:enhancing the differentiated image at saidteleconferencing station from a first resolution to a second resolution,wherein said second resolution is higher than said first resolution. 23.The teleconferencing method as recited in claim 4 wherein said methodfurther comprises the step of:displaying said differentiated image suchthat subjects in said image appear substantially full scale.
 24. Theteleconferencing method as recited in claim 4 wherein said methodfurther comprises the step of:transmitting said differentiated imagedata at a rate of at least 1.544 megabytes per second.
 25. Theteleconferencing method as recited in claim 4 wherein said methodfurther comprises the steps of:decorating said teleconferencing stationto comprise a predetermined motif.
 26. The teleconferencing method asrecited in claim 25 wherein said method further comprises the stepof:decorating said teleconferencing station to comprise a roman motif.27. The teleconferencing method as recited in claim 4 wherein saidmethod further comprises the steps of:situating a plurality of subjectsin said teleconferencing station to define a predetermined sensorysetting.
 28. The teleconferencing method as recited in claim 27 whereinsaid method further comprises the step of:providing said plurality ofsubjects to comprise at least one of the following: a pillar, a plant, atable, a wall decoration or a carpet.
 29. The teleconferencing method asrecited in claim 4 wherein said method further comprises the stepof:displaying said differentiated image at a teleconferencing stationhaving dimensions of at least 20 feet×20 feet×9 feet.
 30. Ateleconferencing method comprising the steps of:generating image datacorresponding to an image, said image not being a cartoon animation;transmitting at least a portion of said image data corresponding to atleast a portion of said image to a teleconferencing station defining amotif; and displaying said at least a portion of said image at saidteleconferencing station during a teleconference such that at least aportion of said image complements said motif and objects in said imageappear to be in the presence of participants situated at theteleconferencing station.
 31. The teleconferencing method as recited inclaim 30 wherein said method further comprises the stepof:differentiating said image data to provide said at least a portion ofsaid image data.
 32. The teleconferencing method as recited in claim 30wherein said method further comprises the step of:processing said imagedata to provide differentiated image data, said differentiated imagedata corresponding to an unwanted portion of said image.
 33. Theteleconferencing method as recited in claim 30 wherein said imagecomprises an unwanted portion which is desired to be removed from saidimage prior to said transmitting step, said method further comprisingthe step of:differentiating said unwanted portion from said image priorto said transmitting step.
 34. The teleconferencing method as recited inclaim 30 wherein said objects comprises at least one participant. 35.The teleconferencing method as recited in claim 33 wherein said unwantedportion of said image is a background.
 36. The teleconferencing methodas recited in claim 32 wherein said method further comprises the stepsof:compressing said differentiated image data to provide compressedimage data prior to said transmitting step.
 37. The teleconferencingmethod as recited in claim 30 wherein said method further comprises thestep of:compressing said at least a portion of said image data toprovide compressed image data prior to said transmitting step.
 38. Theteleconferencing method as recited in claim 36 wherein said methodfurther comprises the step of:decompressing said compressed image dataat said teleconferencing station.
 39. The teleconferencing method asrecited in claim 30 wherein said method further comprises the stepsof:combining said at least a portion of said image data with a secondset of data corresponding to a second image to provide combined imagedata; displaying a combined image corresponding to said combined imagedata at said teleconferencing station.
 40. The teleconferencing methodas recited in claim 32 wherein said method further comprises the stepof:combining said differentiated image data with a second set of datacorresponding to a second image to provide combined image data, saidsecond image having a resolution which is higher than a resolution ofsaid differentiated image.
 41. The teleconferencing method as recited inclaim 30 wherein said method further comprises the step of:combiningsaid at least a portion of said image data with a second set of datacorresponding to a background of said teleconferencing station.
 42. Theteleconferencing method as recited in claim 30 wherein said methodfurther comprises the steps of:capturing said image at a remote station;removing an unwanted portion of said image prior to said transmittingstep.
 43. The teleconferencing method as recited in claim 33 whereinsaid method further comprises the steps of:capturing said image at aremote station; removing said unwanted portion of said image prior tosaid transmitting step.
 44. The teleconferencing method as recited inclaim 30 wherein said method further comprises the step of:displayingsaid at least a portion of said image on a rear-projection screen at ateleconferencing station.
 45. The teleconferencing method as recited inclaim 30 wherein said method further comprises the step of:displayingsaid at least a portion of said image at said teleconferencing station,wherein said teleconferencing station comprises a screen having aparticipant table situated in proximity therewith, said participanttable being configured to cause said participants to be situated apredetermined distance from said screen.
 46. The teleconferencing methodas recited in claim 45 wherein said screen is a rear-projection screen.47. The teleconferencing method as recited in claim 45 wherein saidpredetermined distance is not less than about 5 feet 6 inches.
 48. Theteleconferencing method as recited in claim 45 wherein said participanttable comprises a convex edge in opposed relationship to said screen.49. The teleconferencing method as recited in claim 30 wherein saidmethod further comprises the step of:enhancing said at least a portionof said image displayed at said teleconferencing station from a firstresolution to a second resolution, wherein said second resolution ishigher than said first resolution.
 50. The teleconferencing method asrecited in claim 30 wherein said method further comprises the stepof:displaying said at least a portion of said image such that subjectsin said image appear at substantially full scale at saidteleconferencing station.
 51. The teleconferencing method as recited inclaim 30 wherein said method further comprises the step of:transmittingsaid at least a portion of said image data at a rate of at least 1.5megabytes per second.
 52. The teleconferencing method as recited inclaim 30 wherein said method further comprises the steps of:decoratingsaid teleconferencing station to comprise a predetermined motif.
 53. Theteleconferencing method as recited in claim 52 wherein said methodfurther comprises the step of:decorating said teleconferencing stationto comprise a roman motif.
 54. The teleconferencing method as recited inclaim 30 wherein said method further comprises the step of:situating aplurality of subjects at said teleconferencing station to define apredetermined sensory setting.
 55. The teleconferencing method asrecited in claim 54 wherein said method further comprises the stepof:providing said plurality of subjects to comprise at least one of thefollowing: a pillar, a plant, a table, a wall decoration or a carpet.56. The teleconferencing method as recited in claim 30 wherein saidmethod further comprises the step of:displaying said at least a portionof said image in a teleconferencing station having dimensions of atleast 20 feet×20 feet×9 feet.
 57. The teleconferencing method as recitedin claim 30 wherein said method further comprises the step of:situatinga camera behind a teleconferencing screen at a remote station; capturingsaid at least a portion of said image at said remote station through anopening in said teleconferencing screen.
 58. The teleconferencing methodas recited in claim 57 wherein said capturing step further comprises thestep of:capturing said at least a portion of said image using twocameras.
 59. A teleconferencing system comprising:generating means forgenerating image data corresponding to an image; transmitting meanscoupled to said generating means for transmitting at least a portion ofsaid image data corresponding to at least a portion of said image to ateleconferencing station defining a motif; and display means situated atsaid teleconferencing station for receiving said at least a portion ofsaid image data and also for displaying a non-cartoon animatedtransmitted image corresponding to said at least a portion of said imageat said teleconferencing station during a teleconference such that whensaid transmitted image is displayed at the teleconferencing station, atleast a portion of said image complements said motif and anyparticipants in the image appear in the presence of the participants atthe teleconferencing station.
 60. The teleconferencing system as recitedin claim 59 wherein said teleconferencing system furthercomprises:differentiating means for receiving said image data and fordifferentiating said image data to provide differentiated image data.61. The teleconferencing system as recited in claim 59 wherein saidteleconferencing system further comprises:processing means forprocessing said image data to provide differentiated image data, saiddifferentiated image data excluding an unwanted portion of said image.62. The teleconferencing system as recited in claim 59 wherein saidimage comprises an unwanted portion, said system further comprising:adifferentiator for receiving said image data and for removing saidunwanted portion from said image data.
 63. The teleconferencing systemas recited in claim 59 wherein said at least a portion of said imagecomprises at least one participant.
 64. The teleconferencing system asrecited in claim 62 wherein said unwanted portion comprises abackground.
 65. The teleconferencing system as recited in claim 59wherein said transmitting means further comprises:a compressor forcompressing said at least a portion of said image data prior totransmission to said teleconferencing station.
 66. The teleconferencingsystem as recited in claim 59 wherein said teleconferencing systemfurther comprises:a compositor situated at said teleconferencing stationfor combining said at least a portion of said image data with a secondset of data corresponding to a second image to provide combined imagedata; said display means displaying a combined image corresponding tosaid combined image data at said teleconferencing station.
 67. Theteleconferencing system as recited in claim 60 wherein saidteleconferencing system further comprises:a compositor for combiningsaid differentiated image data with a second set of data correspondingto a second image to provide combined image data, said second imagehaving a resolution which is higher than a resolution of saiddifferentiated image.
 68. The teleconferencing system as recited inclaim 59 wherein said teleconferencing system further comprises:acompositor for combining said at least a portion of said image data witha second set of data corresponding to a background of saidteleconferencing station.
 69. The teleconferencing system as recited inclaim 59 wherein said generating means further comprises:video means forcapturing said image at said remote station and also for removing anunwanted portion of said image.
 70. The teleconferencing system asrecited in claim 69 wherein said video means further comprises:adifferentiator for removing an unwanted portion of said image prior tosaid transmitting step.
 71. The teleconferencing system as recited inclaim 59 wherein said display means further comprises:a rear-projectionscreen situated at said teleconferencing station.
 72. Theteleconferencing system as recited in claim 59 wherein said displaymeans further comprises:a screen situated at said teleconferencingstation; a participant table situated adjacent said screen andconfigured to cause said participants to be situated a predetermineddistance from said screen.
 73. The teleconferencing system as recited inclaim 72 wherein said screen is a rear-projection screen.
 74. Theteleconferencing system as recited in claim 72 wherein saidpredetermined distance is not less than about 5 feet 6 inches.
 75. Theteleconferencing system as recited in claim 72 wherein said participanttable comprises a convex edge opposite said screen.
 76. Theteleconferencing system as recited in claim 59 wherein saidteleconferencing system further comprises:an enhancer situated at saidteleconferencing station for enhancing said at least a portion of saidimage displayed at said teleconferencing station from a first resolutionto a second resolution, wherein said second resolution is higher thansaid first resolution.
 77. The teleconferencing system as recited inclaim 59 wherein said at least a portion of said image is displayed suchthat subjects in said image appear substantially full scale at saidteleconferencing station.
 78. The teleconferencing system as recited inclaim 59 wherein said transmitting means transmits said at least aportion of said image data at a rate of at least 1.5 megabytes persecond.
 79. The teleconferencing system as recited in claim 59 whereinsaid teleconferencing station comprises a predetermined motif.
 80. Theteleconferencing system as recited in claim 79 wherein saidpredetermined motif comprises a roman motif.
 81. The teleconferencingsystem as recited in claim 59 wherein said teleconferencing stationfurther comprises a plurality of subjects which define a predeterminedsensory setting.
 82. The teleconferencing system as recited in claim 81wherein said teleconferencing station comprises a plurality of subjectsincluding at least one of the following: a pillar, a plant, a table, awall decoration or a carpet arranged to provide a predetermined motif.83. The teleconferencing system as recited in claim 59 wherein saidteleconferencing station comprises a modular construction defining ateleconference environment comprising dimensions of at least 20 feet×20feet×9 feet.
 84. The teleconferencing system as recited in claim 59wherein said generating means further comprises:a camera situated behinda teleconferencing screen at a remote station; said teleconferencingscreen comprising an aperture through which said camera captures saidimage at said remote station.
 85. The teleconferencing system as recitedin claim 84 wherein said generating means further comprises:a pluralityof cameras situated at a remote station for generating said image data.